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Abstract 

We consider interactions as bidirectional channels. We investigate the capacities for interaction Hamil- 
tonians and nonlocal unitary gates to generate entanglement and transmit classical information. We give 
analytic expressions for the entanglement generating capacity and entanglement-assisted one-way clas- 
sical communication capacity of interactions, and show that these quantities are additive, so that the 
asymptotic capacities equal the corresponding 1-shot capacities. We give general bounds on other ca- 
pacities, discuss some examples, and conclude with some open questions. 



1 Introduction 

1.1 Motivation I — converting given interactions into abstract resources 

The fundamental physical resource for performing various quantum information processing tasks is the 
interaction between various quantum systems. These quantum systems can be, for example, individual 
registers in a quantum computer, or systems possessed by isolated parties. The interactions are Hamil- 
tonians or their discrete time incarnation as unitary gates that are nonlocal with respect to individual 
systems. The information processing tasks of interest include converting a given interaction into a uni- 
versal quantum gate, generating entanglement between remote parties, and communicating classical or 
quantum information. These tasks output what are considered abstract resources in quantum informa- 
tion theory, such as entanglement and classical communication. The study of the conversions among 
these various resources and the efficiencies thereof has proved to be a fruitful field. 

While our knowledge about the optimal use of a nonlocal interaction to provide the derived resources is 
far from complete, important progress has been made. References [0, ^, § provide motivating examples 
on the interconversion of these resources. Interconversion tasks can be classified as follows: 

(1) Simulation of one nonlocal Hamiltonian or gate by another: work described in Ref. Js| presents 
methods for doing so, some of which are optimal under certain circumstances. 

(2) Generation of entanglement using nonlocal Hamiltonians and gates: Partial results are obtained in 
Refs. §, || g. 

(3) Classical (or quantum) communications using nonlocal Hamiltonians and gates. 

(4) Performing a nonlocal quantum operation using entanglement and classical communication: This is 
the converse of the last two tasks. General formalisms for the 1-shot bipartite case and methods for more 
specific gates are given in Ref. [|. 

These tasks are related. First, entanglement, forward classical communication, and backward classical 
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communication are strictly incomparable resources: no one of them can be generated even from an 
infinite supply of the other two. Thus the capacity of a given interaction to create each of the three 
resources cannot exceed the amount used to perform the interaction. For example, the CNOT can be 
simulated using 1 ebitfl, 1 forward, and 1 backward classical bit juj, so that the entanglement capacity 
and both forward and backward classical capacities are upper bounded by 1. Second, the efficiency for 
one interaction to simulate another provides bounds on the relative efficiency for the interactions to 
generate resources. For example, any capacity of SWAP is at most 3 times that of CNOT since the SWAP 
can be written as 3 cnots. 

In this paper, we focus on tasks (2) and (3), and investigate the capacities of a unitary interaction to 
generate entanglement and perform classical communication. The unitary interaction can be a nonlocal 
Hamiltonian or gate. We are primarily concerned with the asymptotic limit, when many uses of the 
gate (or a long duration of the Hamiltonian) are given. We consider an interaction on two d-dimensional 
systems, allow unlimited local operations, local ancillas of arbitrarily large dimensions, and arbitrary 
input state. We give expressions for the entanglement generating capacity and the entanglement-assisted 
one-way classical capacity Jll], [l^, |l^, [lij of an interaction. We show that these quantities are additive 
in the sense that the amount of entanglement or classical communication generated by n uses of a gate 
is n times the amount generated by one use. 

1.2 Motivation II — interactions as bidirectional channels 

The capacities of generating entanglement and communication are well studied in the context of a noise- 
less or noisy channel connecting a sender (Alice) to a receiver (Bob). In this usual model of a quantum 
channel, a quantum system is physically transported from Alice to Bob, with possible changes (noise) 
caused by a quantum operation |l5) (i.e. a trace-preserving, completely positive, or TCP, linear map). 
This model of a channel is unidirectional - Bob cannot send information to Alice. However, such unidirec- 
tional interactions are a special case of quantum interactions, and in general, a quantum system cannot 
affect another without being changed itself. For example, the CNOT (defined in the computational basis) 
operates in reverse direction in the conjugate basis, and transmits an equivalent amount of information 
in either direction when used in conjunction with other local gates. 

In view of this, we generalize the usual model of a quantum channel to take into account the bidirectional 
nature of a quantum interaction. We define a "bidirectional channel" as a bipartite quantum operation. 
Alice and Bob each inputs a state to the "bidirectional channel" and receives an output. This work can 
be viewed as studying the entanglement and classical capacities in bidirectional channels, restricted to 
the unitary case. 

Throughout this paper a protocol means a procedure that uses a nonlocal gate one or more times, or a 
nonlocal Hamiltonian for some total amount of time, possibly also consuming and/or producing various 
amounts of other standard resources, such as entanglement and classical communication in each direction. 
We always allow unlimited local operations, and we are interested in a protocol's net yield (production 
minus consumption) of standard resources per gate use or unit interaction time. The protocol can be 
written as a quantum circuit, and the net effect can be described as a bipartite quantum operation, with 
bipartite input and output. We call this quantum operation the protocol as well. In general there is 
a tradeoff among the yields of various resources when the protocol is varied. For example, CNOT can 
transmit a classical bit in the forward or backward direction, but not both. As back communication is 
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generic in a bidirectional channel, a protocol using it is generically interactive. 

In the next two subsections, we provide more detailed introductions to the two tasks studied in this paper 
and discuss closely related work. 



1.3 Entanglement generating capacity of bidirectional channels 

In Ref. [Q, the quantum communication capacity of a channel is shown to be equal to its capacity for 
generating pure entanglement; a greater quantum capacity typically results if two-way classical com- 
munication is allowed. Likewise, a bidirectional channel (bipartite quantum operation) can be used to 
generate entanglement. Simple examples are considered in Refs. jlj, ^, ^| 0|. Reference Q considers the 
average amount of entanglement created by one use of a nonlocal operation on a distribution of product 
states. Reference |Q classifies the type of entanglement (bound or distillable) that can be created from 
product states. Reference Q considers the optimal 1-shot rate of creating entanglement using an arbi- 
trary 2-qubit Hamiltonian on possibly entangled pure input states without local ancillas. Reference [Hj 
considers the optimal amount of entanglement created by one use of an arbitrary 2-qubit gate on pure 
product input states without ancillas. References |], ^ also exhibit examples in which local ancillas 
increase the amount of entanglement created. 

In this paper, we follow the philosophy of Ref. |l6) and investigate the asymptotic entanglement generating 
capacity of a bidirectional channel acting on two d-dimensional systems. Contrary to previous work |], 
Bj, we do not restrict ourselves to qubit systems, we allow arbitrary local ancillas and input states 
(including entangled or mixed states), and we consider the most general asymptotic protocols. We 
also consider the effect of many auxiliary resources including classical communication. We restrict our 
attention to unitary bidirectional channels. We derive the expression for the capacity, show that it is 
additive, and discuss the optimal protocol. 

Leifer, Henderson, and Linden [ fl7| have independently shown, by similar arguments, that the asymptotic 
entanglement generating capacity on pure input states is an optimization over a 1-shot expression. They 
also investigate the capacities for many 2-qubit gates with low dimension ancillas both analytically and 
numerically. 

1.4 Classical communication capacities of bidirectional channels 

The classical capacity of an ordinary (unidirectional) quantum channel is in general affected by the 
availability of auxiliary resources, such as entanglement and back communication. For a general 
noisy quantum channel, the capacity without auxiliary resources is found in Refs. [l2| , and that with 
unlimited supply of pure entanglement is found in Refs. | Q , Q |. The capacity for a noiseless quantum 
channel with unlimited supply of a certain noisy entangled state is found in Refs. jH], ^o|, ^l], |22| . 

In treating bidirectional channels, we again follow the philosophy of Refs. |n], ^[ Q and consider 
various asymptotic classical capacities of unitary bidirectional channels of arbitrary dimensions. We allow 
unlimited local resources, including free instantaneous local operations, and the freedom for Alice and Bob 
to attach and remove local ancillas. Shared randomness is also given as a resource. Our philosophy is also 
similar to Shannon's study of the classical capacities of classical two-way communication channels |Ejf . 

A new ingredient in the case of bidirectional channels is the simultaneous forward and backward commu- 
nication, resulting in a pair of achievable rates. One can define many classical capacities other than the 
forward and the backward capacities. Generally, there is a tradeoff between the forward and backward 
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rates. 

Our long term goals are to obtain expressions for these capacities, understand the tradeoff between for- 
ward and backward communication, and relate the quantities to other capacities such as the entanglement 
generating capacity. In this paper, we define various asymptotic capacities of bidirectional channels. We 
obtain an expression for the one-way (forward or backward) entanglement-assisted classical capacity for 
any arbitrary nonlocal gate or Hamiltonian, and the protocol achieving it. The asymptotic capacity is 
achieved by a 1-shot expression, as an optimization over input ensembles for one use of the gate. 

We remark that other independent investigations on optimal methods to perform classical communication 
in low dimensions without entanglement assistance are being conducted ^6|. 



1.5 Structure and assumptions of the paper 

In the next section, we discuss in detail the problem of entanglement generation, and derive the ex- 
pression for the entanglement generating capacity for any nonlocal gate. In Sec. ^, we define various 
classical communication capacities, followed by a derivation of the entanglement-assisted one-way classi- 
cal capacity for any nonlocal gate in Sec. ^. We discuss the similarities and differences between the two 
derivations in Sec. ^. In Sec. ^| we prove various general bounds relating the capacities for entanglement 
generation and classical communication. We conclude in Sec. ^| with open questions and examples of 
unitary bidirectional channels. 

Throughout the paper, we assume the following. Unless otherwise noted, logarithms are in base 2. 0(n) 
and o(n) respectively denote functions linear and sublinear in n. U denotes a nonlocal gate acting on 
two d-dimensional systems (with shorthand d x d) in the possession of Alice and Bob. They have access 
to the following local resources: 

Local ancillas of arbitrarily large but finite dimensions and unlimited local operations. 

We do not consider ancillas of infinite dimensions and do not know if they can be more useful. 

Though we have motivated the discussion with both Hamiltonians and gates, we now argue that it is 
sufficient to focus on gates only. This is because Hamiltonian capacities are simply gate capacities in the 
limit of infinitesimal gates, so that any Hamiltonian capacity can be obtained from the corresponding 
gate capacity. A protocol using a Hamiltonian is similar to one using a gate, with additional freedom 
on how long each free Hamiltonian evolution can last before being interspersed with local operations. 
However, different durations of evolution are simply concatenation of different numbers of infinitesimal 
ones. Thus any Hamiltonian capacity Gh can be expressed in terms of the corresponding gate capacity 
Gu: . 

Gh — lim — Gjj— p -iH S . 

s-»0 S 



2 Entanglement capacity of bidirectional channels 
2.1 Main idea 

Before a formal treatment of the entanglement capacity, we first illustrate our central idea with the 
following example. Let E e be the entropy of entanglement [Q. Suppose the goal is to increase E e as 
much as possible. Different uses of U can be used sequentially or in parallel, and be interspersed by 



4 



LOCC (Local Operation and Classical Communication). We allow an arbitrary pure input state with 
ancillas, possibly entangled over different uses of U . What is the optimal strategy? The answer turns 
out to be very simple. Consider the quantity 

AEu = sup E e (U AB |V>) aba'b' )~ E b (\iI))aba>b'), (1) 
H') aba'b' 

which represents the entanglement generated by optimizing the input state for one use of U. Let 
IV 1 *) aba'b' attain the supremum. Then applying individual uses of U to copies of aba'b' is asymp- 
totically optimal. This is because the total increase in E e in any asymptotic protocol is at most the sum 
of the increases due to each use of U, each is no greater than AEu- 

In the following, we will develop this idea rigorously in the most general setting. We consider mixed 
input states and different entanglement measures, and analyze the roles of various auxiliary resources. 



2.2 Definitions and summary of results 

The entanglement capacity of a gate U can only be defined when the entanglement measures for the 
input and output and the available auxiliary resources are specified. 

Traditionally, entanglement is a qualitative phenomenon. The theory of quantifying entanglement is not 
complete, though much progress has been made J27J (Refs. J2^, |2t| give informative reviews). Based on 
the transformation properties of entangled states, measures of entanglement are defined which are very 
different in the asymptotic and nonasymptotic regimes. Different measures in the same regime can also 
be inequivalent. 

The entanglement generated by a protocol V on an input p is intuitively 

E out (V(p)) ~ E in (p) (2) 

where Ei n , Eout are the input and output entanglement measures specified in the problem. We can now 
define the entanglement capacity of U: 

Definition 1 The f-shot entanglement capacity of U is the maximum amount of entanglement 
generated per use of of U by any protocol Vt that uses U t times, auxiliary resources labelled 



by r, and local resources specified in Sec. 1. 1 (ancillas of arbitrary but finite dimensions and 
unlimited local operations). We consider two possible f-shot capacities, depending on the 
allowed input state: 

1. when the input is restricted to be a product state (without loss of generality 1 00) , since 
Alice and Bob can locally transform any product state into any other product state): 

Et^l,v = sup j [ Eout(7M|00><00|)) - £U|00><00|) ] (3) 
v t 1 

where the superscript denotes "starting from nothing", and 

2. when there is no restriction on the input state: 

E^ZoL.u = ^P sup ~ [ E mt (V t (j>)) - E in (p) ] (4) 
v t p t 

where the superscript * denotes an optimization over all possible input states. 
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The corresponding asymptotic capacities are: 



^in— out,E7 = } lm ^in-^ut.E/ ) ^in^out,!/ = } lm ^in-out.P ■ l°J 
t — >oo t — >oo 

In our notation, we assume that an entanglement measure is written as E x where the subscript x labels 
the measure. For example, in Ei n , Eout, and the notation for the above capacities, the "in" and "out" are 
placeholders for the entanglement measures being referred to. Whenever Ei n = -Eout ~ E x we simplify 
the notation of the capacity to E^'Q . Finally, an arbitrary entanglement measure is written as E without 
the subscript, and the capacity is written as Ejj"'. 

By definition, E^^'out u — ^in^'out v '^ ne ca P ac ity -^in^'out u nas an operational meaning that a supply 
of the initial state p is available at a price E m (p). This is a resource, because the ability to create p with 
an average cost E- ln (p) is generally not guaranteed (unless E- m is the entanglement cost j30|). We refer to 

this as "the resource *" throughout the paper. In contrast, no such resource is assumed in the capacity 

p(t,0,r) 

- Cj in->out,!7- 

Since we are interested in asymptotic capacities, we are primarily concerned with asymptotic measures. 
These include the entanglement cost E c |^] and the distillable entanglement Ed [0 . We also study the 
entanglement of formation Ef which is closely related to E c . All of these measures coincide with 
the entropy of entanglement E e on pure states. As our results apply to more general measures, and may 
be useful in other contexts, we follow an abstract approach psf , which requires more technicalities 
in our arguments. However, the essence can be made clear by relating to our simplified example, and we 
leave this step as an exercise to the readers. 

The auxiliary resources can be divided into three types according to their quantities. The first type 
is given in an amount that is negligible or can be recovered at the end of the protocol. For example, 
sublinear (in the number of uses of U) amount of resources in the asymptotic case are negligible, and 
catalytic resources in the 1-shot case are used and regenerated (for example, see Ref. ]3l]]). However, we 
need not consider these resources. In the 1-shot case, catalytic resources are a subset of the resource * 
and need not be treated separately. In the asymptotic case, sublinear amount of any resource can be 
produced at a vanishing average cost and does not affect the asymptotic capacity. This is because any 
nonlocal gate has nonzero capacity to create pure entanglement and to perform classical communication 
(see Sec. ^ from which any other resource can be produced. The second type of resources are at least 
linear in the number of uses of U. To consider these resources is an important open question, but it is 
out of scope of the present paper. The third type of resources are unlimited and free. In the context of 
generating entanglement, we focus on the auxiliary resource of unlimited 2-way classical communication, 
labelled by "cc". 

Our results can be summarized in terms of the entanglement capacities just defined. 



• In Sec. 2.3 we show that if Ei n = E out = E and E is nonincreasing under LOCC E { J'*' cc) > E ( J tCC K 



Thus given the resource *, the 1-shot capacity is no less than the asymptotic capacity. We give 

-,(l,*,cc) _ ,-,(*, 



a sufficient condition for additivity, E^'*' cc ^ = Ert' cc \ and we describe an optimal asymptotic 



protocol that does not require classical communication (thus E^'*' cc ^ — E^'*^). Additivity holds 
for many measures including E = E c ,Ef,Ed- 



• In Sec. 2.4 we consider E = E c , and we show that E c = E c (j = E c L by describing an explicit 



protocol. We show the same for Ef. In other words, * or cc does not increase the asymptotic 
capacity of E c j, which can be attained without classical communication nor a supply of the optimal 
input. 



In Sec. 2.5 we consider the maximum gain of pure entanglement. This is given by E^^ , and we 
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show that it is equal to E^'^. Thus, the optimal protocol in Sec. 2.4 applies, without the need 
of resources * or cc. 

2.3 Expression for £^*' cc ^ when E in = E ont = E 

Throughout this subsection, E[ n = -Bout = E and both resources *, cc are available. 

Let |$ n ) = ]Cr=i \^)a\^)b be the n x n maximally entangled state shared between Alice and Bob. 

Unless otherwise stated, E satisfies the following assumptions, but is otherwise arbitrary: 

Al. E = for product states. 

A2. E is invariant under local unitaries. 

A3. E is nonincreasing under LOCC. 

A4. V p J5(p®|#„)<*n|) =E(p) 

Al-3 are basic axioms for entanglement measures, while A4 is needed to define the "net" amount of 
entanglement generated by a protocol. Generally, we do not assume E is normalized (V„ E(\<& n )) = logn) 
and will state the assumption explicitly when it is needed. 

We first state a lemma based on the following simple observation j^, [i| . Alice and Bob can implement U 
if Alice teleports her input to Bob, who applies U locally in his own laboratory and teleports her output 
to her. This consumes two copies of and 21ogd bits of classical communication in each direction. 

Lemma 1: Vt E^'*' cc) < 2E(\<S> d }). Thus E^ cc) < 2£(|$ d }). If E normalized, E^ cc) < 21ogd. 

Proof: For any protocol with t uses of U and LOCC, modify it by replacing each use of U with its 
double teleportation implementation. Let pi n and p out be the input and output of the original protocol. 
The modified protocol uses only LOCC and has input pi n CB> |4 > d)(4 > d|® 2t and output p ou t- Applying A3-4 
to the modified protocol, 2tE(\$ d )) < E(p out ) - E(p ln ). □ 

We now proceed to prove Theorem 1, which says that the asymptotic capacity is equal to the 1-shot 
capacity given the resources * and cc. This is done by proving two separate inequalities, each is referred 
to as a half of the Theorem. 

Theorem 1 (1st half): E$'*' cc) < E$'*' cc) . 

Proof: Since LOCC operations cannot increase entanglement, the best 1-use protocol has the form: 

A' 

A — 
B — 
B' — 



u 



(6) 

The only optimization is over the initial state, and thus 

E (X*,cc) = gup e(Uab paa'bb< U\g) - E(paa'bb') ■ (7) 

PAA'BB' 

In Eq. (^) and throughout the paper, the subscripts of an operator denote the systems being acted on. As 
an aside, e£>*> cc) = since no classical communication is used in the protocol depicted in Fig. (ph. 

Now, consider any protocol with LOCC and t uses of U. Without loss of generality, we can divide the 
circuit into time steps each having either 1 use of U or only LOCC operations. The entanglement can 
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only increase in the t time steps with U and each is described by Fig. (g), by defining the ancillas A' and 
B' to include all registers not acted on by U in that time step. Thus, the total amount of entanglement 
generated by the protocol is no more than tE ( J'*\ and E ( J'*' cc) < = E$'*> cc \ □ 

We now consider sufficient conditions for additivity, E^'*' cc ^ = Ejj'*' cc \ We say that E is weakly additive 
on p if E(p® n ) — nE(p). E is weakly additive if it is so on all p. We say that E is strongly additive on 
pi ifV P2 E(p-i® p 2 ) = E(pi) + E(p 2 ). E is strongly additive if V P1 , P2 E(pi ® p%) = E(pi) + E(p 2 ). Weak 
and strong subadditivity and superadditivity are defined by replacing the equality in the corresponding 
additivity definitions by the inequalities < and > respectively. 

Theorem 1 (2nd half): If E is weakly additive or subadditive on the optimal input in Eq. (ffl), and is 
weakly additive or superadditive on the optimal output, £^>* ,cc ) > E^'*' cc \ 

Proof: Consider the t-use protocol that repeats the 1-use protocol in Fig. (|^) t times, each on a separate 
copy of the optimal input. The entanglement generated is at least tE^'* , and E^'* ,cc ^ > E^'*' cc \ d 

Note that any measure which is weakly additive and is nonincreasing under LOCC satisfies both halves 
of Theorem 1 and E^' cc ^ — E^'*' cc \ Weak additivity is not an axiomatic property of entanglement, and 
need to be checked for individual measures before applying the 2nd half of Theorem 1. On the other 
hand, Theorem 1 does hold for most commonly used measures. Examples include E c , Ed p^], and the 
Renyi entropy (which includes the mixed state generalizations of the logarithm of the Schmidt number 
and E e as special cases). It is an open problem whether Ef is weakly additivity, however, we will prove 
that the second half of Theorem 1 still holds for E — Ef. 

In general, we say that "Theorem 1 holds" whenever both halves of Theorem 1 hold. Eq. (Q) then provides 

f# cc) 

an explicit expression for E v ' achieved by repeating the 1-use protocol. In fact, the protocol only 
requires a supply of the optimal input (the resource *) but not classical communication, and Ejj ,cc ' = 
Ejj\ In the following, we prove some lemmas for E^'*' cc \ We discuss how to obtain this supply of 



optimal input in Sec. 2.4 



In the expression for E^'*'^ in Eq. (Q), the supremum is taken over finite but arbitrarily large dimen- 
sional ancillas A'B' . This can also be viewed as a limiting quantity as the ancilla dimensions increase. 

Lemma 2: Suppose we restrict to n- dimensional A', B' in Eq. (^), and denote the subsequent maxi- 

(1 * cc) 

mization by e„. Then, lim„-,oo e„ = E v ' ' . 

Proof: The sequence {e n } is increasing and bounded above by Lemma 1. Furthermore, V £ >o 3n such 
that e„ D > Eft'*'™^ - e. □ 

Lemma 2 provides one possible way to estimate the supremum in Eq. (Q) suitable for numerical ap- 
proaches. 

(1 * cc) i* 

We say that "iS^ ' ' — e can be attained on an input p if the 1-shot protocol generates an amount of 
entanglement E^ , *' cc ' 1 — e on the input p. The next 2 lemmas show that the optimal input for the 1-shot 
capacity in Eq. (Q) can be chosen pure for the specific measures Ef and E c . 

Lemma 3: For any e > 0, E^ 1 ^'^ — e can be attained on a pure input state. 

Proof: Let Paa'bb' be a state attaining the supremum in Eq. (Q) to within e. We omit the system 
label when it is AA'BB' in Lemmas 3-4. Let p = ^\ \i\ipi){ipi\ be an optimal decomposition so that 
Ef{p) = Y.^Ef{\^)). Then, 

E%*' cc) -e < E f (U AB P U\ B )-Ej{ P ) 

< Y, A * ( Ef{UAB\i>i)) - E f {\^)) ) < max ( E f {U AB \i>i)) ~ E f {\^)) ) . (8) 



8 



The second inequality is obtained by applying convexity of Ef to the first term, and the definition of the 
optimal decomposition in the second term. Thus, E^^ — e can be attained on a pure input state, d 

Lemma 4: For any e > 0, E^£ — e can be attained on a pure input state. 

Proof: Let p attain the supremum in Eq. (R) up to e/2. That is, 

E%*' cc) ~^ < E C {U ABP U\ B )-E c {p). (9) 

For any 5 > 0, 3 m such that ±-E f (p mm ) - E c (p) < 5 ffl. Substitute this into Eq. M with S = e/2, 



E 



(l,»,cc) 



\ < E c {Uab p U ab ) - ^Ef(p® m ) + | . (10) 
Using weak additivity of E c and the fact Ef > E c , the first term in the RHS of Eq. ( |Io[ ) can be rewritten: 

E%'^~e < ^[Ef{UtZp® m UZ m )-Ef( P m )] ■ (11) 

But the expression in the bracket represents the entanglement of formation generated by a certain m-use 
protocol, and is no greater than mEf^' 00 ' by the 1st half of Theorem 1. Together with Lemma 3, 

E%*' cc) -e < sup E f (UABW)-E f (W). (12) 

Finally, we replace Ef by E c on the RHS since they coincide on pure states, 

£(M,cc)_ £ < gup E B (U AB \il>))-E e (\1>)), (13) 
IV-) 

which proves our claim. Q 

When Theorem 1 holds, lemmas about E^j'*' cc ^ hold for _E[j*' cc ' as well. This fact, together with Lemmas 
2-4, lead to many useful corollaries. These are given with numbers matching those of the corresponding 
lemmas. 

Corollary 2: If Theorem 1 holds for E, then _e[ 7 *' cc - ) is achievable using the 1-use protocol in Fig. (^) 
with sufficiently large dimensional A' , B' . 

Corollary 3.1: Since Ef is strongly additive on pure states, Lemma 3 implies that Theorem 1 holds for 
Ef- 

Corollary 3.2, 4.1: Corollary 3.1 and Lemma 4 imply that V £ >o, E^*^ — e and E^*^ — e are each 
attainable with the 1-shot protocol in Fig. (^) on a pure input state. 

Corollary 3+4: E^ 1 '^^ = E^ff' cc ^ with common pure optimal inputs. Same for E^*^ = E^*^ c \ 

It is unclear whether E^'^'^ — e can be attained on a pure state. Convexity in Ef sC is required in our 
proofs of Lemmas 3 and 4, but unlike Ef and E c , Ed may not be convex j§3|. 

Note that Theorem 1 is concerned with weak additivity of the entanglement capacity of bidirectional 
channels, i.e., the protocol uses only one type of nonlocal gate. We can consider strong additivity when 
different types of nonlocal gates are available: 



Theorem IS (1st half): For a protocol with n, uses of the gate Ui, the maximum amount of entan- 
glement generated (given *, cc) is no more than Yli^^u- • 

Theorem IS (2nd half): If E is strongly additive or subadditive on the optimal input and strongly 
additive or superadditive on the optimal output for each Ui, then repeating rii times the 1-shot protocol 
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(l,*,cc) 



for Ui for each i generates an amount of entanglement no less than £\ TiiE l 

In particular, Theorem IS holds for E — Ef. c , and the entanglement capacities are strongly additive 
given * and cc. 



2.4 Auxiliary resources are unnecessary when E = E c or Ef 

In this subsection, we show that the resource * is unnecessary in the optimal asymptotic protocol in the 
previous subsection (repeating the optimal 1-shot protocol) for the specific measures E ln — E out = E c j. 



of entanglement E^*^ — e — E^. 1 ^"'^ — e can be generated by adapting an argument in ||. The protocol 
first creates m copies of the pure optimal input |V')aa'sb' (inefficiently), and then repeats the cycle: (1) 



By Lemma 4, when E = E c , the optimal input and output of the 1-shot protocol are pure. The amount 

J c,U 

l„/,', 

I AA'BB' 

apply U® m , (2) "concentrate" ^] the outputs to EPR pairs and (3) dilute some of the EPR pairs to form 
fy) AA'bb' fO' S 0' ^ or l ar S e m > dilution and concentration take 0(y/m) classical communication 
and waste 0(y / m) amount of entanglement |i6| , |37[ , both can be supplied by an additional 0(y / m) uses 
of U - negligible for sufficiently large m. The cost of creating the first m copies of the pure optimal input 
inefficiently is also negligible when the cycle is repeated sufficiently many times. The same argument 
holds for E = E f . 

Corollary 3.3, 4.2: E { *£ c) = E^% and E f *£ c) = E f %. 



The asymptotic entanglement capacity for E — E c j under the most general setting in Sec. 2.2 can be 
generated with no initial entanglement and without * nor cc. The core part of the optimal protocol is 
basically 1-shot - tensor product of the optimal 1-shot protocol. The only collective steps, entanglement 
concentration and dilution, are auxiliary. 

Since no initial entanglement is required for the optimal asymptotic generation of Ef and E c , one can 
relate the asymptotic entanglement capacities to the Schmidt number of U. Any bipartite pure state can 
be written as Y2i -M</>i) A\f}i) b where Xi > 0, Y2i Af = 1 and are orthonormal sets of states. 

Likewise, any d x d bipartite unitary gate can be written as Y2i ^iViA ® WiB where Xi > 0, Y2i A? = d 2 
and {W^b} are sets of operators orthonormal under the trace norm. The Schmidt number of 

the bipartite pure state or gate |iH[ denoted as Sch(-), is the unique number of terms in the above 
"Schmidt decomposition." The are called the Schmidt coefficients. We will repeatedly use the fact 
that the Schmidt number of a state is nonincreasing under LOCC and that Sch((7|i/>)) < Sch([7) Sch(|i/>}) 
(see Ch. 6.4.2 of @). 

Corollary 3.4, 4.3: E { *£ c) = E f *£ c) < log Sch(£/). 

Proof: Since E^ c) and E ( *£ c) can be achieved without initial entanglement, the initial state has 
Schmidt number 1, and the final state of a t-use protocol has Schmidt number < Sch(Z7) . Hence, the 
output entropy of entanglement is < t log Sch((7), and E^*^ = E^*^ < log Sch((7). D 

Corollary 3.5, 4.4: E ( c ^ } = Ef ^ > - £\ X 2 log X 2 where {Xi} are the Schmidt coefficients of U. 

Proof: This is the entanglement generated when \iJj)aa'bb' = \^d) AA'\^d) bb' in Eq. ([?]) |39| . d 

Interested readers can repeat the above analysis for other measures. It holds for E — Ed if the opti- 
mal input p is pure or if p satisfies E c (p) — Ed{p) (by replacing concentration with distillation of the 
optimal output and replenishing the optimal input p using Ed(p) EPR pair per copy of p and classical 
communication (see Appendix [X]) ) . 
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2.5 Different input and output entanglement measures 



Each choice of entanglement measures for the input and output can be given an operational meaning. 
We consider the important example of creating EPR pairs in this subsection, which requires different 
entanglement measures for the input and output. Alice and Bob fabricate the possibly mixed optimal 
input state and distill entanglement from the output. Thus, the appropriate choices for the input and 
output entanglement measures are the entanglement cost E c and the distillable entanglement Ed^ Let 
Vt denote an optimal f-shot protocol and the corresponding quantum operation, and let p be the optimal 
input to within e (again we omit the system label AA'BB'). Then 

E^-e < \[E d {V t {p))-E c (p)} < ~ [ E c (Vt(p)) - E c (p) ] 

< sup E c {Uab\i/>)) - E c (\i/>)) (14) 

= sup E d (U AB \i>}) - E c (\i>)) 

w 

*d,U 



where we have used Corollary 4 in Sec. 2.3 to obtain Eq. (|l4j). This means that the asymptotic capacity 
to create EPR pairs is 

E<£$ = sup E c {Uab\iP)) — E c (\il>)) (15) 
IV>) 



and the protocol in Sec. 2.4 is optimal for creating EPR pairs even in the most general setting described 



in Sec. 1.1 



Furthermore, since the optimal output is pure, and Ed, E c are strongly additive on pure states strong 
additivity (Theorem IS) holds when different types of gates are given. 

2.6 Summary 

We summarize our results obtained so far: 

1. E { f'* ] = £g'*' cc) > E^' cc) for all E. 

2. -By'*' = -E[^' cc ' = E^ for all E weakly additive on the 1-shot optimal input and output. 

3. E ( f'* ] = E^ ] for E = E cJ . 

In particular, when E- ln = Eout = Ef, or when Ei n = E c , and E ut = E c or Ed, the asymptotic capacities 
become independent of the availability of * and cc, and they are all equal to E^ = AEu in Eq. <JTJ> . 
The only capacity mentioned above that is different from AEu is EQ® 1 t v We wm study these two 
capacities in Sec. ^ and Sec. |§| in more detail. 

As an aside, when E = E c (or Ef), if E {1 ^ ] < E^, then E {t '® ] < E W V for all finite t. This is because 

tE im < E cm + {t _ 1)jB c : -m) = E cm + {t _ 1)E m < tE m . 



Note that t is finite, but we have chosen the asymptotic measures E c and Ed- We are mainly interested in protocols with 
large t, with the understanding that the 1-shot capacity E£!?] y is achieved with collective pre- and post-processing. 
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3 Classical capacities of bidirectional channels 



If Alice and Bob have access to a nonlocal gate U to couple their systems, then the classical communication 
capacity of U is the maximum asymptotic number of classical bits that can be reliably transmitted per 
use of U. Communication can be achieved simultaneously in both directions, with possible tradeoffs. 



Free local resources as stated in Sec. 1.5 and shared classical randomness are always allowed. 



In the context of classical communication, the most important auxiliary resource is free entanglement. 
Communication is called "assisted/unassisted" when the resource is/not available. 

The most general protocol (see Sec. 101 ) with t uses of U can be represented as: 



A' 

A |Mi) 

B \ M *) 
B' 

(16) 

In Fig. (|l6|), A and B label the systems acted on by U during the protocol and the systems carrying 
the classical messages before and after the protocol, while A' and B' label the rest of Alice and Bob's 
systems. The dimensions of A and B are converted to d by the initial operation Aq ® Bo and are further 
converted by the final operation At ® Bt- The freedom to apply U to any register is included as swap 
operations in Ai and Bi . Without loss of generality, the local operations Aj , Bj can be assumed unitary, 
since measurements can be deferred until the end of the protocol. In fact, no measurements are needed, 
except for the final readout of the transmitted messages. In the unassisted case, the initial ancilla state 
I'i/'anc) a'b' can be taken to be |0r) A / |0r) s / with random r, which can generate an arbitrary separable 
state. In the assisted case, \tp an c)A'B' can be taken to be the maximally entangled state, which can 
generate an arbitrary entangled state. 

Each protocol allows a certain amount of forward and backward classical communication, giving a pair 
of achievable rates (R->, -R<-) for the gate. 

Definition 2 A pair of rates (R^, R^) is said to be achievable by a gate U if it is possible to 
intersperse t uses of U with local unitaries Aj®Bj, such that an ni-bit message Mi from Alice 
to Bob and an n2-bit message M2 from Bob to Alice are communicated with high fidelity, and 
_R_» > ni/t, i?_ > n-zjt. Mathematically (i?-», R^) is achievable if: 

Ve, 3t, 3n!>tR-,, 3n 2 >tR^ , 3{Aj ® -Bj}* =0 , 
s.t. VMi € {0, l}™ 1 , VM 2 G {0, l}" 2 , 

|Vw) = (At ® B t ) U ■ ■ ■ U (Ai g> B x ) U (A ® B ) (|Mi)a \M 2 )b |i/w}.4',b') 
and F( |M 2 )a® \Mi)b , ti A > B , |^out)(^out| ) > 1-e (17) 

In the above definition, the fidelity F between two states an d P is given by (i/>|p|V'} (this is a 

simplified expression when one of the states is pure). 

We first discuss unassisted capacities, and the assisted capacities are defined in exactly the same way. 
Each gate U defines a region of achievable unassisted rate-pairs (R^,, R^). The region is convex by using 



A 



Bo 



V 



Ai 



U 



A, 



B t 



|M 2 ) 
I Mi) 
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mixed strategies. Furthermore, if (R^,R^) is achievable, so is any (R'_>,R'^.) where R'_> < R^ and 
R'^_ < In particular, the boundary of the achievable region never has positive slope (see Fig. 
Thus, the forward and backward capacities can always be achieved at the boundary points, and can be 
defined respectively as 

C^,u — sup{i? : (R, 0) is achievable by U} 
C^,u ~ sup{i? : (0, R) is achievable by U} 

We can also define various bidirectional capacities, for example, the duplex and the total capacities: 

C<->,u = sup{7? : (R,R) is achievable by U} 

C+,u = sup{7?^ + ii-> : (R<-,R^) is achievable by U} 

We omit the subscript U when the notation is too cumbersome. The following is a schematic diagram for 
the achievable region and the definitions of the various capacities. We present all the known properties 
and intentionally show the features that are not ruled out, such as the asymmetry of the region, and the 
nonzero curvature of the boundary. 




There are much simpler examples - the unassisted achievable region for CNOT and SWAP are similar 
triangles with vertices {(0, 0), (0, 1), (1, 0)} and {(0, 0), (0, 2), (2, 0)} respectively (see Sec. ||). 

In general, little is known about the unassisted achievable region of (J?_,,ii_) besides the convexity 
and the monotonicity of its boundary. The most perplexing question is perhaps whether the region has 
reflective symmetry about = i?<_, which implies C_> = and C+ = 2C„. Refs. [^, [)| show 
that any two-qubit gate or Hamiltonian is locally equivalent to one with Alice and Bob interchanged, 
so that the achievable region is indeed symmetric. This implies the conjecture in Q that the one-shot 
forward and backward unassisted capacities are equal. In higher dimensions, shows that there are 
Hamiltonians (and so unitary gates) that are intrinsically asymmetric. However, it remains open whether 
the achievable rate pairs are symmetric, or more weakly, whether = or C+ = 2C«. 

Assisted capacities CE,y, C^_ tU , C^. jj, C+ u can be defined in exactly the same manner, now the ancilla 
|i/'anc} J 4'_B' is maximally entangled instead of being lOr}^' |0r) s / in the definition of the achievable rate 
pairs in Eq. (|l7|). The properties and open questions of the achievable region are also the same as those 
in the unassisted case. Two simple examples are the assisted achievable regions for CNOT and SWAP, they 
are similar squares with vertices {(0, 0), (0, 1), (1, 0), (1, 1)} and {(0, 0), (0, 2), (2, 0), (2, 2)} respectively 
(see Sec.|). 

Entanglement assistance greatly simplifies the analysis of the classical capacities Ce of the usual (uni- 
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directional) quantum channels.^] An expression for Ce has been found and proved to be strongly ad- 
ditive iflil , The study of Ce also provides useful upper bounds for the unassisted capacities and 
insights to the classification of channels In the next section, we derive a simple expression for C!t t u 
and G^Lu, the 1-way (forward or backward) entanglement- assisted capacity of any bidirectional channel. 
Surprisingly, this capacity is also strongly additive, as in the unidirectional case! 

Comparison of the two problems of generating entanglement and classical communication will be given 
in Sec. H, and the two resulting capacities are related in Sec. fj. 



4 Entanglement-assisted one-way classical capacity 
4.1 Preliminaries and definitions 

In this section we derive expressions for C^y and C^L V , as defined in Eq. ( JTt| ) with ji/'anc) being a 
maximally entangled state. Without loss of generality, we focus on CE, v . It can be evaluated using 
the general framework of 1-way classical communication with quantum resources |^, [y], [l^]. In this 
framework, suppose classical messages i, occurring with probabilities p i: are encoded in the "signal states" 
r/i received by Bob, forming an ensemble £ = {pi,rji}. The information on i obtained by measuring a 
signal state is upper bounded by the Holevo information X for the ensemble £ , defined as 

X(£ = {p i ,r l i} ) = S (J>i7?i) - . (19) 

i i 

The Holevo-Schumacher- Westmoreland (HSW) Theorem states that this amount of mutual information 
per signal state is achievable given the ability to transmit an asymptotically large number of signal states. 
(See (nlpl and Ch. 12.3.2 of O].) 



We will see that the optimal methods to generate EPR pairs (see Sees. 2.4-2.5) and entanglement-assisted 
classical communication have many similarities. The respective goals are to maximize the increase in 
entanglement and the Holevo information. The optimal asymptotic strategies in both cases are to repeat 
the 1-shot protocol, with an optimal input state in the former and with an optimal input ensemble 
in the latter. In the case of entanglement generation, allowing the most general 1-shot optimal input 
with arbitrary ancillas and initial entanglement makes the 1-shot capacity equal to the asymptotic ones. 
Likewise, we will allow the most general 1-shot input ensemble for assisted classical communication, and 
will show that the resulting 1-shot capacity is equal to the asymptotic capacities by establishing a method 
to "replenish" the optimal input ensemble (analogous to concentration and dilution in entanglement 
generation) . 

Let £ = {pi, \ipi) aa'bb'} be an ensemble of bipartite states. A trace-preserving operation acts on £ 
by acting on each component state (preserving its probability). For example, we will write U£ = 
{Pi, UabI^aa'bb'}, trAv£ = {pi,tr AA i\ip i )('ilJi\ AA i BB i}, andtiAA'U£ = {pi, tr^/ Uab |V>;}(V>i I aa'bb' 



We have the following definitions analogous to those in Sec. 2.2 



Definition 3 The i-shot Holevo information capacity of U is the maximum increase in Holevo 
information per use of U due to any protocol Vt that uses U t times, the auxiliary resources 



labelled r, and the local resources specified in Sec. 1.5. There are two possible t-shot capacities 



3 



Note that the resource E appears as a subscript of the capacity for a unidirectional channel, and as a superscript for a 
bidirectional channel. 
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depending on the allowed input ensembles: 

1. when the input ensemble So is restricted to satisfy X(tiAA'£o) = 

Ax (t,0,r) = gup gup l X ( trAA , Vt £ ) (20) 

V t £ 1 

2. when the input ensemble £ is unrestricted: 

AX^*£} r) = sup sup i [ X{tr AA.VtS) - Xitr^S) ) (21) 
v t e t 

Since we always assume free entanglement as an auxiliary resource, and we always focus on forward 
capacity, we omit r = E and — > in the above notation: 

Ax (M) = Ax WM?) s and Ax (t,.) _ Ax (t,* }B ) (22) 

We have 

C*,p = sup Axp V) (23) 
t 

Note that it is unnecessary to consider mixed state ensembles in Eq. (^o|) and Eq. ( |2~lj ) - we can replace a 
mixed state Paa'bb' by its purification aa'A"bb' , where A" is the purifying system, without affecting 

In the next two subsections, we will prove Ct, v = AX^ 1 '*'. We first prove that Ct,v < Ax { u'*\ and 
then we describe a protocol to achieve the upper bound, thereby proving additivity and providing an 
optimal asymptotic strategy. 



4.2 An additive upper bound 

We first prove an analogue of Lemma 1: 
Lemma 5: C5,c/ < 21ogd and C^,u < 21ogd. 

Proof: Consider a t-use protocol. Replace each use of U by double teleportation (see Lemma 1). If 
the original protocol consumes and produces Ci n and C ou t bits of forward communication, the modified 
protocol consumes and produces Cm + 2 1 log d and C ou t bits of forward communication. By causality ^] 
of the modified protocol 2 1 log a! > C ou t — Ci n . Hence CzL, u < 2 log a!. Similarly C^L u < 21ogd. (Note 
that the above proof is stronger than we need, since we have allowed Ci n 7^ 0.) d 

Consider the best 1-shot protocol to increase the Holevo information. Since local operations do not 
increase mutual information, the optimal 1-shot protocol is to just apply U, as in Fig. (^). Thus 

AX^ 1 '*' = sup [ X(t fAV US) - XfaM'S) ] (24) 

£ 

where the supremum is over the most general bipartite pure state ensemble £ = {pi, \ ipi) aa'bb'} ■ 

We now consider the asymptotic problem. Using the same idea that proves Theorem 1 (1st half), we 
obtain the following analogue. 

Theorem 2 (1st half): Ax^'*' < Ax^ 1 '*' . 

Proof: Consider the most general protocol Vt with t uses of U (such as depicted in Fig. (p^)). Let £ 
be an arbitrary bipartite input ensemble. Then, the total increase in X is upper bounded by the sum of 
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the stepwise increases. Since local operations cannot increase X, and the increase in X by each use of U 
is bounded by Eq. (^ ) , 

XitiAA'TtS) - X(ti aa,£) < t Axfr^ , (25) 

from which the Theorem is immediate. □ 

It follows from Eq. @ and Eq. Q that C*5 < AX^'* . 

4.3 Protocol to achieve the upper bound of C£ v 

In optimal asymptotic entanglement generation, the following basic cycle is repeated: 

(1) convert EPR pairs into n copies of the optimal input state, 

(2) apply the gate to each, 

(3) convert the n copies of optimal output state into EPR pairs. 

More EPR pairs are obtained in (3) than used in (1) - as excess entanglement generated. 

In entanglement-assisted classical communication, we want a similar basic cycle: 

(1) convert classical communication to create n states drawn from the optimal input ensemble, 

(2) apply the gate to each state, 

(3) convert the states from the optimal output ensemble into classical communication. 

Step (1) is called remote state preparation [ pH| |l4| , ^Bj (RSP), a procedure whereby Alice helps Bob to 
construct quantum states of her choice in his laboratory using entanglement and classical communication. 
In RSP, Alice performs a measurement on her half of the shared entangled state, sends the outcome to 
Bob, who conditioned on the outcome operates on his half of the shared entangled state to complete the 
RSP. It is known how to approximately prepare n pure bipartite states from an ensemble £ with free 
entanglement and nX{tilAA'£ ) + o(n) bits of classical communication. Step (3) follows from the HSW 
Theorem: Alice can communicate « nX(tXAA'U£) — o(n) bits to Bob reliably if she can prepare n states 
in the output ensemble U£. Just like the case of generating entanglement, n is chosen large enough to 
ensure the efficiency of steps (1) and (3). 

When describing and analyzing the protocol, we loosely call £ the optimal ensemble achieving the supre- 
mum in Eq. (p4|). For arbitrarily small e, £ is chosen so that X(tr^>U£ ) — X(tTAA'£ ) > AXy'*^ — e. Since 
how e enters the following analysis is obvious, and the analysis is independent of the choice of e and £ , e 
is omitted for simplicity. 

Protocol that achieves u = Ax^'*': 

Let £ be the optimal ensemble. If Alice is given nX(\,TAA'£ ) bits of classical communication as an initial 
resource, she can transmit k messages M i=l ^... ^ each of length n AXy (a total of n k AXy bits) with 
nk uses of U as follows: 

• Alice's preprocessing: Alice determines k messages Ni each of nX(tTAA'U£) bits. Each Ni has 2 
parts: the message Mi of length nAx[J '*\ and an RSP instruction Ri of length nX(tTA,A'£) for Bob 
to create a state G £® n such that U® n \<p i+1 ) £ {U£)® n encodes N i+1 (by the HSW Thm). In 

order to generate Ri for Ni, Alice needs to determine \<f>i+i) and to perform her measurement for the 
RSP of \(f>i+i). This in turns requires knowledge of iVi+i. So Alice first computes the last message 
Nk (in which Mk is known and Rt is irrelevant), classically calculates \<fik), performs measurement 
for the RSP of \4>k) to find out Rk-i in Nk-i, works her way backwards through Nk-i, • • ■ , Ni, 
determining \<f>i) from Ni and performing measurement for RSP for \4>i) for decreasing i. 
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• Quantum protocol: Alice uses the given initial classical communication to create \(j>i), which she 
shares with Bob. Then [/"® n is applied to convert it to U® n \<f>i), Bob reads off the message Ni, 
which consists of Ri to instruct him to do RSP for \cf>2) and so on. 

The protocol is summarized in Fig. (p6|). 



|< nXitr^US) >| 



N k = 



N l+2 -- 

HSW ^ 

u® n \<t> i+2 ) 

"t 



Ri-\ 



(if2)S 



N i+1 = 

HSW | 



lv-*+lK^SP (!+1)s 



Ni = 

HSW ^ 

I'M ^SPis 
Ro 



Mi. 



Mi+i 



M, 



Mi 



Last RSPa 
not needed 



-RSP 



(H3)A 



-RSP 



(M-2)A 



-RSPfifi 



.RSP 2 



.RSPi 



(26) 



In Fig. (|26j), RSPiA denotes Alice's RSP measurement to obtain the instruction Ri for Bob to prepare 
\<pi)- RSPis denotes Bob's conditional operation to complete the preparation of \<j>i). 

The initial amount of classical communication can be created by Alice and Bob using cn uses of U 
inefficiently, for some constant c.0 The communication rate is 

hAx ";" j^Axr. (27) 

cn + nk u K ' 

We have not yet discussed small inaccuracies and inefficiencies in the protocol. The asymptotic correct- 
ness of this protocol comes from the asymptotic reliability of its component pieces: RSP and the HSW 
Thm. However, since errors and inefficiencies accumulate over many rounds, we need to choose the rates 

4 Sec. [j| shows that any nonlocal gate U has nonzero communication capacities in both directions. 



17 



of increase of n and k slightly more carefully. 

Suppose that preparing a member of £ ® n with RSP requires n(X(tT AA /£) + 5* SF ) bits of communication 
and has error e P f p , where 5JJ; SP ,e P f p — > as n — > oo. Similarly, a state in (U£)® n provides n(X(tr AA iU£ ) — 
<5Jf w ) bits of information with error e™ w , where again (5Jf w ,e" sw — > as n — > oo. Combining these into 
8 n = 5^ SP + S" sw and e„ = e^ sp + e^ sw , we find that the communication rate is 

M ^ ! ;1;^ 1 - K" °<*- 2 >) — <-> 

and the total error is ke n . This vanishes if one chooses k first, and then chooses n such that e„fc is small 
(n thus depends on k). 

We summarize the order of the limits. First, choose the optimal ensemble £ to approximate AXy'*'. 
Second, choose k large to make c/k negligible (to overcome the initial cost). Finally choose n large to 
make both of ke n and 8 n vanish. As this protocol does not require initial mutual information, it follows 
that: 

Theorem 2 (2nd half): lim^^ Ax^' 0) > Ax^ 1 '*'. 
Putting these together gives: 

Theorem 2: Ct = lim;^ Ax^' 0) = Ax^ 1 '*' = sup £ [ x(tr AA ,U£) - X{tr AA ,£) ] . 

Thus initial mutual information does not increase the asymptotic capacity, analogous to entanglement 
generation. Finally, we generalize Theorem 2 to prove strong additivity: 

Theorem 2S: The classical communication achievable by n; uses of Ui is asymptotically 5Zi ni Ax^.'*'. 

Proof: The argument that proves Theorem 2 (1st half) can be applied to prove that the amount of 
communication generated is no more than Y2i n i^^u'* > which is achieved by applying the optimal 
protocol for each Ui separately. 

4.4 Additivity 

We conclude this section with two observations about additivity: 

• We emphasize that in Theorem 2 (1st half), the Holevo bound is applied to the output of a general 
protocol Vt with possibly entangled inputs to different uses of U. Thus the 1-way entanglement-assisted 
capacity for unitary bidirectional channels is strongly additive independent of whether the Holevo infor- 
mation X is additive or superadditive. 

• In the optimal asymptotic protocol, the n copies of U are applied to n states each chosen from the 
optimal input ensemble. Thus, entangling the inputs to different uses of U does not improve C^ j(/ . 

5 Discussion 

Despite the many similarities between generating entanglement and entanglement-assisted classical com- 
munication, there is an important difference. Communication cannot be stored and be used later. In 
particular, Alice needs to work backwards in our optimal entanglement-assisted communication protocol, 
so that the classical messages need to be known at the beginning of the protocol to share the initial cost. 
In contrast, entanglement can be stored. The optimal entanglement generation protocol can be stopped 
and resumed at arbitrary times. 
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We can generalize the first half of Theorems 1 and 2 to any other quantity which is monotonic under 
the given resources, as long as a sufficiently general input (e.g. state or ensemble) is allowed for the 
1-shot capacity. In particular, the input should possess all the properties the output may possess. If 
in addition the quantity is weakly additive or subadditive on the optimal input and weakly additive or 
superadditive on the optimal output, repeating the optimal 1-shot protocol allows the upper bound to 
be attained asymptotically, and additivity holds. 

We end this section with a discussion on the parallel versus sequential applications of bidirectional 
channels in a protocol. Note that there is no such distinction for unidirectional channels (in the absence of 
back channels) , as the output state of a given application of the channel is with the receiver and can never 
be used as an input for later uses. For bidirectional channels, there are sequential schemes that cannot 
be made parallel. For example, the protocol for entanglement-assisted 1-way classical communication in 
Sec. 4.3 cannot be made parallel. Sequential schemes are always at least as powerful as parallel ones. 



The opposite is true in the asymptotic regime, in which case any capacity of U® n (i.e. one must apply n 
copies of U in parallel) is equal to n times the capacity of U. The proof is simple - let Vt be any protocol 
that uses U sequentially. A particular i-use protocol for f/® n is to run n copies of Vt in parallel. Thus 
the i-shot capacity of [7® n is no worse than n times that of U, and equality holds. 



6 Other general bounds 

We have proved a few simple general bounds: E^ ' cc ' < log Sch(C/) < 21ogd and u )C-^-,u < 21ogd. 
We now derive other general bounds that hold for all U. We focus on the entropy of entanglement E e , 
and on the two capacities E^^ and E^ (= E^ 1 ^) since the latter is equal to many entanglement 



capacities of our interest (see Sec. 2.6) 



Bound 1: U is nonlocal <^ E^ ] > <^ C-.u > <^ C<_ lt / > 0. 



Proof: The first equivalence follows from Corollary 3.5 in Sec. 2.4. Let _Eo > be the amount of 
entanglement created by applying U to \$d) AA '\$d) bb' ■ 

Alice can send a noisy bit to Bob with the following t-use protocol. Bob inputs to all t uses of U. 

To send "0" Alice inputs to share tEo ebit with Bob. To send "1" , Alice inputs J0)a to the first use 

of U, takes the output and uses it as the input to the second use, and so on, so that their final entanglement 
is no more than log d. Thus different messages from Alice result in very different amount of entanglement 
at the end of the protocol. Using Fannes' inequality ^t], |48), tEo — logd < tlogd tr|po — pi\ + ~ 
where p; is the reduced density matrices of Bob when Alice sends i. For any e > 0, 3t such that 
Eo — e < logd tr|po — p%\ and Bob can distinguish po from pi with nonzero advantage. It means that the 
i-use protocol then simulates a noisy classical channel with nonzero capacity and C_,,c/ > 0. Obviously 
C-,jj > implies U is nonlocal. Similarly C^^u > U is nonlocal. □ 

Bound 2: E ( ® ] > C+,u- 

Proof: Suppose a t-use protocol Vt transmits n a bits from Alice to Bob and ni, bits from Bob to Alice 
with fidelity 1 — e. Recall from Sec. ^ that Vt can be assumed unitary with the ancillas starting in the state 
|0r)^/|0r)B/ where r is a share random variable. Let \x)a\v)b carry the messages to be communicated, 
where x and y are n a - and rib-bit strings. Then, by definition (Eq. (0)), the state change is given by: 

\x) A \y)B\Qr) A ,\0r) B > \rjxy) 

S.t. y x ,y F (\y) A \x) B ,tr A , B '\rjxy){r]xy\ABA'B') = i - ^xy > 1 - (29) 
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By Uhlmann's Theorem |48|], there are normalized states \c xy ) a'b' and \e xy ) aba'b' such that 



\rj xy ) = \J\ - € xy \y) a\x) B \c xy ) A'B' + y/^y \ e *y) ABA'B' , (30) 

and tXA'B'\e-xy){e, xy \ aba'b' has support orthogonal to the span of \u)a\x)b- 

To prove £$> 3 > C+. u, we simply change the inputs to the protocol so that it creates entanglement. 
Alice's input system A is now in a maximally entangled state with another ancilla A" , each with 2 n ° 
dimensions, and similarly for Bob. Thus the input state is given by 

■2- (na+nb)/21 £\x)A\x) A ,,\y) B \y) B ,,\0r) A ,\0r} B , , (31) 

xy 

where x and y are summed over their possible values. The output is given by 

xy 

= 2- ( "«+"> )/2 ^ [ ^T^7ry\y)A\x)B\c xy ) a'b' +V^\^v) -aba'b' ] \x) A "\v) B" ■ (32) 

xy 

To calculate £(|?7 (e) )), we first calculate -E(|r? (0) )) for 

| ?? (0) ) = 2- {n ^' 2 Y^\y)A\x)A"\x)B\y)B"\Cxy)A'B'. (33) 

xy 

E(\rf ^)) is simply the entropy of Alice's reduced density matrix, which can be found by the "Joint 
Entropy Theorem" (Eq. (1.58) in JTHJ ) . 

S(tr BB , B „\ V ^)( V ^\ ) = S (2-^ + ^Y.\y)^\ X )^\A" ®^B'\Cxy){Cxy\) 

xy 

= n a + n b + 2~ (na+ " b) J2 S ( tl B'\cx y )(cx V \^>n a + n b . (34) 

xy 

We now relate E(\r/ (t) }) to E(\r] W )). The inner product (r/ (()) j?/ s) ) can be bounded: 

(?7 (0) |?7 (C) ) = 2~ ( " a+ni>) ^2{y\A{x\ B {Cxy\A'B' [ \J 1 ~ txy \y) a\x) B \c x y) A 'B' + \/tx~y~ \&xy) ABA'B' ] 

xy 

= 2-^+^^yi - — y 



> s/T~e (35) 

where we have used the the orthogonality oi {\x) A"\y) b" }xy to obtain the first line, and the orthogonality 
of \y) a\x) b\cx V ) a'b' an d \e xy ) aba'b' to obtain the second line. The trace distance D between \rf ') and 
|t/ e >) is defined to be |tr| jr/ 0) )<r/ 0) | - \v (c) ) (ri (e) I I, and D(|j? (0) ), |r/ e) )) = y/l - (jjW ^M} 2 < y/l (see 
Sec. 9.2.3 of |@). Using Fannes' inequality @ or the continuity of the entropy of entanglement |}48f| . 

E(\V W )) - E(\ V (C) )) | < log ( Sch(|r/ 0) )) + Sch(jr/ E) » ) 2D(\ V m ), |^ e >» + 1/e (36) 
< (2tlogd + n a +n b ) 2^+ 1/e. (37) 

We explain how Eq. ( p^j ) is obtained. First, Schfl?/ 6 ')) < d 2 ' because the protocol has a product 
initial state and Sch(U\i>)) < Sch((7) Schfl^j)) We now bound Sch(|r; (0) )). A Schmidt decompo- 

sition of |t? (0) ) can be obtained by Schmidt decomposing each \c xy ) in Eq. ( |33| ) so that Sch(|r; ( - ^)) = 
J2 X v Sch(\cxy}). For each xy, Schdc^}) < Sch(\rjx y }) since Ic^y) can be obtained with nonzero probabil- 
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ity by locally measuring \rj xy ). Each \r) xy ) is obtained from a product initial state after t applications of 
U, and SchflT/a*)) < d 2t . Altogether, Sch(|?7 (0) }) < d 2t 2™" +n " and Sch(ir/ 0) }) + Sch(j77 (c) >) w d 2t 2 n " +n \ 

From Eq. @ and Eq. @ we can lower bound the entanglement generated per use of U : 
jE(\ V (e) )) > Un a + n b )-2V~t [2\ogd+-(n a +n b )]- - 

L L Lot/ 

As t increases, e can be made arbitrarily small and j{n a + rib) — > C?+,lr- Furthermore, 21ogd + (n a + 
rib) /t — > 21og<i + C+,!7 < 41ogd is well bounded. The above equation then implies -E^f' > C+ : u- d 

Remark: In the proof above, it is crucial to bound Sdid^'}) and Schdr/ '}) as functions of t, and our 
bound is based on having a product initial state. Furthermore, the two limits t — + oo and e — + are 
dependent. Thus one cannot assume e — > as a separate premise in the above proof and extra care is 
needed in how the limits are taken. 

After this paper was first posted, Berry and Sanders |5(J| proved that if the capacity is achievable by an 

? (0) 



exact protocol (i.e. e = 0), then Cf u < E^ + E^ 



To adapt the proof of bound 2 for Cf :U < E^+E$ in the general case when e > will require an explicit 
bound on Sch(|r/ e -')) and Sch(|r/ ^)) and knowledge of how various inaccuracies vanish asymptotically, 
so as to specify how various dependent limits should be taken. So far, we do not see how this can be 
done. 

In the following, we prove a weaker bound C!t,u 5= &u + ^ui f° r e > 0, by adapting the proof of bound 
2 and an idea from [^|, as well as using details on the optimal protocol for achieving CE± t u an d an 
improved method for RSP of bipartite pure entangled state that uses less entanglement than the method 
in Ref. fl(|. 

Before we present the proof, we give an interpretation of E^ = E^*^ as the entanglement destroying 
capacity of U 

sup [ E(\^) AA , BB ,) - E(UabWaa>bb>) ] , (38) 
Waa'bb' 

since LP creates as much entanglement on the input \ip) as U can destroy on U*\ip). Note that to 
disentangle a state unitarily is a nonlocal task. We now turn to our proof. 

Bound 3: C^<4"+^>. 

Proof: We omit details already given in the proof of bound 2. Let Vt be a unitary protocol transmitting 
n a bits from Alice to Bob with fidelity 1 — e. The ancillas are initially in the maximally entangled state 
\<&m)a'b' where logM is the amount of initial entanglement required to assist the communication. Let 
\x)a carry the n a -bit message of Alice. The state change is given by: 

\x)a\0)b\$m)a>b- \n.) (39) 



\rj x ) = y/l- t x \0)a\x)b\c x ) a >b> + V^x \c-x)aba'b' (40) 
where tr A'B'\ex){e x \ABA'B' has support orthogonal to |0)>i|a;).B. 

In the entanglement generation protocol, Alice inputs half of \&2™a) aa" while Bob still inputs \0) B . The 
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input and output states are given by 



2~ n ° /2 Y, \x)a\x)a»\Q)b\$m)a>b> and (41) 

X 

|r/ (e) ) = 2-"» /2 ^ [ VT~ T x \Q) A \x) B \c x ) A , B , + ^fT x \e x )ABA>B> ] \x) A » . (42) 

X 

For \v (0) } = 2- na/2 i: x \0}A\x} A „\x} B \c x } A , B ,, 

E(\ V m }) = S(tr ss ,|r/ 0) ><77 (0) | ) = S (2" n ^ |0><0U|a:> (x\ A „ ® tr B , \c x ) (c x 



= n a + 2 ^ S (tr B >\c x ){c x 

X 

x 

Applying the definition ol the entanglement destroying capacity to Eq. ([fo|), 

V. E ( 1 77, ) ) > log M - tE$ *° = log M - tE$ . (43) 
Since (0|a(3;|s(c ;c | j 4'b'| j 7!c)a4'ss' > VI — £ and Schdc}) < Schd^}) < Md 2t , Fannes' inequality implies 
y x | E(\c x )) - E(\r) x )) I <4Ve[logM + 2tlogd] +l/e. (44) 

Hence, 

S(|J7 (0) )) > n a + log M - £E$? - 4^7 [ log M + 2i log d ] - 1/e . (45) 

Using (r7 (0) jr/ £) ) > VT^, and that Sch(|?? (e) )) < d 2t M, Sch(] 77 (c,) ) ) < J2 X Schflc*)) < 2 na d 2t M, Fannes' 
inequality implies 

E{\rf 0) )) - E(\r/ <e) }) | < 2Vi[logM + 2tlogd + n a ] +l/e. (46) 

Thus 

S(|f7 (e>) ) > n a + log M - tE$ - 2^7 [ 3 log M + 6t log d + n a ] - 2/e , (47) 

and 

Ef > \ [ E(\ V V)) - log M ]>\n a - E$ - 2^1 [ | log M + 6 log d + ^ ] + | . (48) 



In particular, consider the entanglement-assisted communication protocol in Sec. 4.3. For any 5s > 0, 3£ 



consisting of Mo -dimensional bipartite states with x(U£) — X(£) > Axjj'*^ — 5e = C^i.u ~~ fe- Following 
Eq. dig)) , t = cn + nk for some constant c and the rate is ^ > C^jj — fie — 5 n — jC^^u, where S n — > 
as ?i — > cx) . The total error is e = ke n , where lmin^oo e„ = 0. The RSP method in jits) can be improved 
jHlJ to prepare n states from an ensemble £ with nX(tr AA i£) + o(n) cbits and nS(p B ) + o(n) < n log Mo 
ebits where pa is the average reduced density matrix of the ensemble as seen from Bob, so that M < Mq. 
Putting all these parameters into Eq. (jig), 

4 0) > - [ 5 £ + 5 n + | Ct,u ] - 4«? - 2^ [ 3 log M + 6 log d + ^ ] + (49) 

For any S > 0, choose 
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1. £ such that 8s < 8/5, 

2. k such that f C5,c/ < <5/5 and 2/e(c + fc) < 5/5 so that 2/en(c + fe) < 5/5, 

3. ra such that <5 n < 8/5 and e = fce„ small enough for l^fe [ 3 log Mo + 6 log d + ^f- ] < 8/5. D 

To summarize, for all (7, we have 

• CU,t/, C_,t/ < C* + ,c/ < < Sch(C7) < 21ogd. 

• C_,!7 < Ct,u < min(21ogd, £<, 0) CU,t7 < C5,e/ < min(21ogd, E^+E^), 
Clja < C^ v + Ct,u < min(41ogrf,2(£« + £«)). 



We now return briefly to Hamiltonian capacities. Recall from Sec. 1.5 that any Hamiltonian capacity 
can be expressed in terms of the corresponding gate capacity, Gh = lim s -»o - G u=e -iHs . The finiteness 
of Gh is not immediate from the above definition. Even though G u=e -iHs — ► when s — > due to 
continuity, it is not guaranteed that G u=e -iH S < O(s). One may argue that physically, the rate should 
be finite, but the availability of unlimited local resources complicates the argument. We now provide a 
proof of the finiteness of the Hamiltonian capacities |5^] . 

Bound 4: Hamiltonian capacities are finite. 

Proof: Recall that the entanglement capacity of a gate U is no more than the average amount of 
entanglement required to simulate U given free classical communication. Reference 0| describes a method 
to simulate U = e -1 " "*® "* with O(a) entanglement, which implies that Eh=<j z ®<j, is finite. Reference 
|io[ describes a method to simulate any other Hamiltonian H' in d x d using the Hamiltonian H = a z (&o z 
with 0(d 4 ) overhead, so that Vjj' E H i < 0(d 4 )EH=tr z ®<Tz is also finite. 

The finiteness of other capacities now follows. From the summary, H , < 2(E H i + E_ H i) is an upper 
bound to all other classical capacities of H' . □ 



7 Bidirectional channels on d\ x 

We have assumed U acting on a d x d bipartite system. We note here that all the results discussed hold 
for a nonlocal gate (or Hamiltonian) acting on a d\ x d z system (without loss of generality, d\ < (fe) . 
The interested reader can easily verify that all the arguments hold in this case, because the fact d\ = d-z 
is never used in the proofs. We also note a subtle observation, that the d\ x d-z case is not described 
by embedding the operation in a cfe x d z system by taking the direct sum with a cfe — d\ dimensional 
identity matrix acting on the side of lower dimension. 



8 Open questions and examples 

We have found expressions for the entanglement capacity and the entanglement-assisted classical capacity 
of unitary bidirectional channels, defined classical capacities for them, and provided general bounds for 
the capacities. We conclude first with a list of open questions, followed by examples to illustrate our 
results and our open questions. 
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8.1 Open questions 



• How large do the ancillas A'B' need to be in the optimal input for entanglement generation? How large 
do A'B' need to be, and how many states are needed in the optimal ensemble for entanglement-assisted 
classical communication? These are important for numerical studies of the capacities. 

• Will infinite dimensional ancillas improve the entanglement capacity and the entanglement-assisted 
1-way classical capacity? Will an ensemble with an infinite number of members improve the latter? 

• How do the forward and backward rates trade off with each other (in either the unassisted or assisted 
case)? 

• Are forward and backward classical capacities always equal (in either the unassisted or assisted case)? 

• Is there a gate U with C+,u < E^ a strict inequality? 

• Is E^^ — E^'^ for all VI Both quantities relate to how entangling a nonlocal gate is. However, we 
can only prove the equality when U = U T , by using the fact -E^V = E^^} (U* is the complex conjugate 
of U). This generalizes the proof in Ref. |H(J for 2-qubit gates since U = U T for all 2-qubit gates in 
their normal form EJ|. Numerical work suggests that the equality does not hold for some U in higher 
dimensions |53. 

• When can a gate be simulated efficiently, i.e., by an amount of some resource equal to the capacity? 

• How do auxiliary resources of quantities linear in the number of uses affect the capacity? 



8.2 Examples 

Example 1: Let U = CNOT. It can be simulated using 1 ebit and 1 bit of classical communication in each 
direction Q. Thus E^ OT < 1, CNOT ^ 1j C~,CNOT ^ 1> an d bound 2 further implies C + cnot < 
1. These are all achievable with obvious methods, without the need of entanglement assistance in 
cnot an d Cf_ cnot an d without the need of initial entanglement in -Ecnot' Therefore, C_ cnot — 

C_,CNOT = C +>C NOT = C5,CNOT = C5,CNOT = ^CNOT = £ CNOT = L The rate P airS in the triangle 

with vertices (0,0), (0,1), (1,0) are achievable without entanglement assistance, and convexity implies 
no other pair is achievable. We also have Cf cnot — 2 due to the following protocol. Starting with 
the EPR state \<&2)ab, Alice applies a£ and Bob applies a\ if their respective input bits are a and b. 
The CNOT is then applied, converting the state to ( |0) + (— l) b |l) ) \a). Thus rate pairs in the 

square with vertices (0,0), (0,1), (1,0), (1,1) are all achievable with entanglement assistance, and by 
monotonicity, no other rate pair is achievable. 

Example 2: For U = SWAP, we have the general upper bounds ^gWAP — ^ 5 C^i swap — 2, swap — 2, 
and bound 2 implies C +i swap < 2. These are all achievable as follows. SgvvAP ~ ^SWAP ' s achieved 
on the input [<f>2) AA' I $2)bb'- To achieve the forward assisted classical capacity, Alice and Bob start 
with the state \<&2) ab"\<&2) bb> an d Alice applies o°^ A a a z \ when her 2-bit message is a\a^. Then SWAP 
is applied. In other words, superdense coding [[l8] is performed, consuming an existing EPR pair on 
AB" , while a new EPR pair is created on AB' simultaneously. Thus the unassisted and assisted one-way 
classical capacities are both 2. C + SWAP = 2 is achieved in the obvious way. Superdense coding in both 
the forward and backward directions implies swap — 4, and by the monotonicity of the achievable 
region of assisted rate pairs, C? swap = 4- Therefore, any rate pair inside the triangle with vertices 
(0, 0), (0, 2), (2, 0) can be achieved without entanglement assistance, and any rate pair inside the square 
with vertices (0, 0), (0, 2), (2, 0), (2, 2) can be achieved with entanglement assistance. 
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The CNOT and SWAP are very simple. We now turn to more intriguing examples. 



Example 3: The gate J acts as 

|00>~i=(|00) + |ll)), |11)^_L(|00>-|11», |01) ~|01>, |10>~|10> 

where the first and second registers are A and B (same throughout the examples). Without ancillas, J 
creates 1 ebit but seems to create less than 1 cbit in 1-shot, but [^(j) presents a product 2-qubit input 
that communicates 1 cbit from Alice to Bob. 

Numerical optimization of the generated entanglement with 2-dimensional A' and B' in Eq. (R) is 1.83186 
ebit, and the optimal input has 0.055338 ebit. As a comparison, only 1.8113 ebit is generated by inputting 

\$2) AA>\®2) BB> ■ 

Starting from |$2)ab = ^(|00) + |H))i Alice and Bob can communicate one bit to each other, by 
applying a" and a\ if their respective messages are a and b. The J gate further converts the state to 
|x)a|:e).b where x = a + b mod 2, from which they learn each other's input. 

We suspect C +i j < E^K For instance, the best total rate we found requires creating 1 ebit with 1 use 
of J followed by assisted two-way communication in the second use of J. Asymptotically, 2.83186 uses of 
J can create at least 1.83186 x 2 bits of communication, so that C + .j > 1.2938, which is much less than 
1.83186. 

Example 4: Denote the "cyclic permutation" gate by CP. It acts as: 

\ x )\y) -» \ x )\y-t) if v t^o 

\x)\y) -» |a:— 1>| W — 1> if 2/ = 

where ± is modulo d. Sch(cp) = 2 for all d, thus E^p < 1. This is achievable on the input -^|0)a(|0) + 
|l))s- C<_ CP > 1 since Bob can send 1 bit b 6 {0,1} to Alice with the input |0)a|&)b- Thus Vd 
-^cp ~ -^cp — ^<-,cp — ^+,cp — J- 

For d = 3, we have also studied forward communication without ancillas. It is impossible to transmit 1 
bit from Alice to Bob by one use of U, but it is possible asymptotically, so that (X^cp = 1- 

Example 5: Define the gate AE on d x d by: 

V x |a;0) <-> \xx) , 

VO^t/ and y =f= x \xy) <-> \xy) . 

Since Sch((7) = d, Ej^ < logd, and this is achievable on the input J2 x \x)a\0)b, and E^ = E^^ — 
logd. C_,ae — logd is achievable in the obvious manner. Thus C + ae = logd. We can prove that one 
use of AE can communicate strictly less than log d cbit from Bob to Alice starting from product states 
but allowing ancillas. However, we suspect C^ae < C_,ae = logd. 

Example 6: Since the initial submission of this paper, Childs, Leung, Verstraete, and Vidal [^3[ have 
analytically proved that the asymptotic entanglement capacity of any Hamiltonian locally equivalent to 
a a x (g> ctx + P o y (g> a y can be achieved without ancillas, and the capacity is « 1.9123 (a + /3) following 
Ref. 1. 
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A Linear bound in communication cost for distillation 

In this appendix, we obtain a bound on the communication cost in distillation using |^2| which derives 
the enhancement factor of the capacity of a noiseless quantum channel assisted by noisy entanglement, 
i.e. unlimited supply of the mixed state p. 

Suppose given p® 9 ", cn forward classical bits (in either direction) is sufficient to distill n ebits (q > 
1/D(p)). Here, we do not require maximum yield of entanglement, so that the classical communication 
cost is upper bounded by that required in the more difficult job of distillation. 

Then, the following is a noisy superdense coding strategy for Alice and Bob - first distill and then perform 
noiseless superdense coding: 

cn cbits + p® qn —> n ebits 
n ebits + n qubits — > 2n cbits 

Together, the enhancement factor is equal to 2n ~ cn = 2 — c, which cannot exceed the optimal value 

nS(tr A (p)) - S(A A (p® n )) 



c sd = i + sup 2J 



„ Vi S(A A (tr B (p®")) 

where the supremum is taken over all trace-preserving completely positive maps Aa on Alice's half of 
p® n . Hence c > 2 — C s d = A. Even though it is not known how to calculate A for an arbitrary p it is 
unlikely to be zero for all p. If 3p for which A ^ 0, then distillation would take at least linear classical 
communication. 
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